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The author discusses information retrieval systems 
with particular emphasis on the assignment of descriptor terms* She 
first explains the rnethod of determining terms by deriving key words 
from a title or abstract^ a procedure that can often be carried out 
by a computer. After discussing the problems of derived indexing^ the 
author explains the procedure of assigning terms to an article^ a 
slower process that requires indexers'who are familiar with the 
discipline. This provides greater control over the information. The 
author then describes the efforts of an ad hoc committee of the 
speech Communication Assouiation to organize and begin to establish a 
retrieval system useful for researcher i in the communications 
discipline* (RN) 
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Information Avallablis fycm ya^flQua For^iata to rietrleve Data 

la Dea^idjiir 1P67 cm Ad Hgc CotnT-ittc^i on Information Retrisval^ wag 
eatablishad by ths Speech Assoaiation of Amaria^ in an atcenpt to iiivas i-igat^ 
the usefulness of nn InforDE^rlon Retrieval Syytern for rassarch conduoted by 
Kembisra of the oygaii/ ^auioa. The iwmdl^xt^ problGm vas to dst^rniina hw 
bast to organise and ass^amblft aha Informaftion a^mllabU so that the reguXtlng 
retrieval syafcem would bs a helpful ri&earch aid, 

^{ost retrieval ^yntam sre based on a list af Ind^x or descriptor tcirao 
whieh arc nsoociatad with an arti^ae Iti soiiig ^ay, but the pirocedura for 
determinins these d^t5cript;or ta-ms falls roughly into two msiin csnn^s.^ Ihv 
first Is to scan the artlcloo to bs uGfid as the data hssia and cs^traau rrom 
them terms which eeem to ba rslsvant to tha miln thema. Suc:h terms may ran/ye 
from words lifted out of tha titlm which soam to tvAlQnx^ tha oubjsct mattsw, 
to words takan from an abstract or ths article Itsalf . Thif: nGthod is simple 
and may be applied by ind^sxers who have no specific knotfledca of tha dlaclpliiritt. 
For this reason, of cDurse, the wrk can be carried out by ::y^chincs. All one 
needs CO do is apecify certain coittinQn words (usually function wordB) which :?hould 
not be used, and a cnmputar caii be programiied to index the matarial* 

However there are Lt^rlous problsDS with the above mthod of darlved 
lnd©ri,ng, the mosc Important of ^^hleh is nhst it is difficult to control tha 
terminology. For ^Konipae, It is gften umiit^ to mBxim th£t a tltlM at 
article gives m accurate sumiary af its ccntant. Titlas are offt^ii 'vsgi-i^^ 
or deliberately 'syfr-catchlng- . Thay xmy t^ell succeod In attracting attaiition, 
but are of little uja in a rotrieval sygtam, Slinilsrly author abstraitE niay 
not always provide a true reflsction of the cGntunt^of a papftr but more of the 
paper the author would liks to hava written.^ Even if tmvm are talf,en directly 
from tha article, aa they are in a mc (Kay Word in Context) type of Inda^cing, 
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there is still no guarsntea that the content will be acGui-ately repr^senced, 
Hera ef course uhe problem is Che IndividuaUty of aut^ors. Any discipline 
QotitBlm many Coras which sre interchangaabls and the ixm of an alternativa 
depecids on individual prefsirencft* In tht field of SpaBnh CoimuniGation yuch 
a terin seems to be ''attitude change" versus *'cplnign change", , By this mathoci 
of Indexing it would bo imposaible to relata two paparii wiiich used these tsms 
separately i even though they were referring to tha sam© topie. This mat tar of 
synon^ is an impoycnnt one to which wm Kill return latere 

Mother Important problem is that It Is essential to distinguish beti^a.5n 
words and phrases used r,s daecrlptprs* In other words ^ th© phrase ''laaguaga 
analysis" Is a different retrieval tarm from the tarns "language" and "analysla" 
used separately* This Is aomethlng vrliich has to be conoider^^d whan basing a 
aystem on ratrleval froii descriptors dari'TOd from th^ content ^ sinae one would prob- 
ably receive %^ery diffarant inforaation in mach case, 

Iha BQQom procedure for determining descriptor terms is to tagign 
them to an arCiele by acanning it and deciding upon a numbQr of tenns whluh 
seem to indlcul-^ the main thams. At once one can see several mjor differeseas 
from the derived inQthod of IcdaKing, In th^ first place, it is usually eoi^entlal 
for IndaKera to be familiar i^lth th^ dineiplins ^ince they wiLll need to miikd 
judgmanti about the niiiterial* This of coursa iaxm.^diataly s^Iowa the proce4^i? 
considerably, and it is iTrposslble for ifiad-lnes to do' this work^ But the main 
advantage Is that far greater control over the infomatlon is possible. It 
eliminates the possibility that a tltla, or even words from the text, may not 
be representative. It also provides an aJflclGnt way of dealing with ths 
"attltude/opiniQu'^problem nientlonsd ebove since the IndeKer may chcose one of 
these tsrsa and apply it consistently In iivery case. The critsrlon for the 
viholce would usually be the popularity of one term over the other/ Then oitca 
the deelslon is made the fact that the tvo phrasGy are regarded as oynenyiuous 




could be Indlcatsd by attaching such terins as *use' and 'used for^ to both terms. 
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e.g. if £ha decision Is made in f^mt of '-attitude ehange," the thesaurus entries 
vould read I 

^TITUDE CItANGE 

UF Opinion Ch'iAge 
OPINION CHANGB 

Uso Attitude Chanea 
Once such a problem hos beeii fsc^d and raaolved it is possible to svus 
how one arrives attthe concept of s theGAurus (or authoyicy liet), A thssaurus 
has been descTibad as "a tarirt-associatlon list structured to enable inde:-ors 
md Bubjeet analysts to describe the oubjeat infoimtion of a documant to a 
dtslred level of speeiflclty at input, and to paralt s^arahera to dascrlbe in 
mutually praelsG tamm tha information r^iqiilred at output Here one has the 
greataat dagrea of control over the iuforimitlon since one hao ths potential to 
organise ths descriptor terms to shoi^ g^neriG or hiararchical relatJonships, to 
offer dafinltiorAS of terms where nticessary^ to Indioate eyntactlo and syiionymonr* 
ralfttlonahlps, and ganarally to of far thfi user thci widi^at pOBSlble help in 
searching for information* 

At this point ^e ean mora easily appreciate tha problenis faced by the 
Inforration Retrieval Conffi^ttaa of the SCA^ It is difficult to say that one 
approach is better than another becauae different aubjact areas require different 
approachas* A helpful sunMary of points to eonsidar is provided by Charles 
Bourne (1965)^ 

Type of ultimate uaer (the users vary in needs ^ habits ^ and approriches) , 
Type of iDanediate user (librarian or customer). 

Charaeteristics of the File Collecuian (current and eKpacued b±zb, 
TBtiM of growths \mriety and eoinplaxity of fiubject content, a forr.Ht 
of rile material)* 

Availability of user existing indexers for thia same type of file inaterial. 

Complexity and required aecuracy of searehers to be conduated (current 
awareness, Gomprehensive re trospeetlvesearM 

Number of searehers e^eeted , and their required responie time. 
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7. Currant usar and llbrariaG attitudeo to^^^ard the existing inde>:ing syces 
a form of dicplay* 

8. Resources available for d^ivelopinf the gyoterai oou^^arting the baaklog 
of material to th€ new system or naw niethod of display, and £uiine.iinin^ 
the roytina oparatloii. 

As always^ tha most importaai consideration was the last one, partly 
becausia it was impossible to answer such questions as the first and second 
as the davelopFiont wan only to ba a pilot project wb-^ah would be asseyG^^d after 
a -trial run* , 

With this vague brief # the comilttee had to deeide how to approach tha 
field of speech communicaClon so as to produce a retrieval oystam most usaful to 
researchers la the discipline. It was impossible to mm all journals to ba 
used in tha final data base so it waa decided to sa^la the literature of the 
field to ascertain what were the key concepts usod by resc-archers. The i>i5,'a:iAl 
saiBpla was taken from nine journals iimedlately connected with SCA,^ In 
retrospect this may have been a mistake because only four of the nina jcurualSj 
namely. The Quarterly Joumrjl of Speech ^ Fhilogophy md Rhetpric . Jouraa; . q£ 
CoTOtunic atlo?- and S peech Monographs , could be designated as ^'scholarly" jcurnala, 
with any consistency at all. Slaca a retrieval syjsteE is eventually only m good 
m its data bases this may have contributad to the pramature decision ol SCA not 
to continue the vooject. 

Nevartheless the title Md smeary (or first and last 100 words) of 
randomly selected articles in the journals from 1965-1968 with the ^Dstr^.cts 
of the 1968 SCA Convention were subjected to computer analysis,^ and a concordance 
af all the words in the base was produced* Thea^ because of the above E^ationed 
limitations of terms Caken from contexc, the co^dfctee searches the concordance 
mnually for acceptable descriptor terms* This resulted in a list of lltOO 
terM ^ilch was further suppleraQnt.^ by adding the terms from selected author 
abstracts s plus tht^ abg tracts f-^am the 1969 and 1970 SCA conventioas. 

With sush a small data base it was possible to use It ao a basis for 
d#veloplng a thaoaurus. It would a^st certainly be poesible to organiae the 
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descriptor tenns in sucli a way as to reflect the csmtral aoncmptu of the discipliae 
attempting to show the relationships betvaen them. To this end, every tara in 
:e preliminary' list was aKaminad ia ralntiioxi to civary otha-r lo^ord. The procsss 
of building uhe Thasaurnu;i and ch^ temB through whidi th,g relationships 
etructujred (Broad T^rm, Narrow To™, Related Tarm, itc). ara dcacribed in 
Borden s Jenkfns Md atom (1972)^ 

Also deacribsc: in that article is the concept of a iac^Xzd Thssau-o-is 
which' is seen as grouping of dasctlptoir teifm/j which fall into the earns 
general coneapt area'^ (p, 13), Thie U a mo^t u^^ful mathod of dealing with 
soma of tha semantir: problaEa in inSormation ratrieval. For eKanipla^ it is 
obvious that the structure of a thssauru^ will re:Mect tha poiat of view 
Its builders, and that others my not neceasarily agree with their clasoificaticn 
aah^ma, but when oim has aac^as to a facet, it is posyible to see connactioas 
between termj at widely dlfitimt levals on the 'trae' hiersrahy. In this way, 
miy individual biw rasulting from an initial level of simple Mom Tera, Narrcw 
Term relationships tands to iainimi4.od. Ifliat one seo lu fact, is that the 
fitld sMms to br^^nk down into distirict nonceptnal arsas, such as RhfltoirAcal 
IfeSSSi £E£liE £ZSffli£l.i and BrQadcastin^, 

It Is also poasible to see that tha approach to waning is purely 
altuaticnal. In other words, terms art! u&ad , only as they are found in rha 
literature, and any other meaning of a Wird or phrase is ignored. An exan^nXa 
of tlil^ can be seen in the uta oE th« descriptor term ''interpretation.'' Xn 
the Thesaurus thi^: msms only ''or^l intarprctHilon" in connection ^^th R^adtirs 
Theatre, and hos not;4ng to do with translating from one language into ano::her. 
Similarly 'abstract' is uo^d only in the s^nse of 'tnformatlou retrieval V 
la a broader sens- we might esanin^ the term •Civtl yurV to nota that subo-dinatfi 
terms are 'aboUtioa' and *Aate BellCT', In ochar wordg, it is only the jtogricm . 
Civil War whidi Is relewat, and no other* • 

Once the Thsaaurous vaa strticcured, it was then aecessary to rstum to the 



literature to see whether the system would work. By this tiaa the co-operation 
of Indl^ddual authora aiid of che journal edltora had been obtainad in that thay 
agreed to complate the Journal Abstract iom at the l-DLA, supplying a short 
abstract plus ntnm index carmj i*hich irare felt to ba most descriptive of an 
article. GenQrally, the origlnsl article alao scanned to enauro that the 
central thSBes were rapresautad in the aba tract. 

In decidlni which author gftnsraftod das crip tor tenia to Include in tha 
Thesaurous aim of the inain consideration was always the depth of Indexing 
poasibie. Often authors offered tarrm aa broad as conmunicatioa as as narrow 
as "allophone" for an article in the field of ^■oral interpretation". Many tlwas 
of course, the depth of indexlag ulll oaly reflect the paac and present iaportflnee 
of ft topic lu the Speech dlacipUne. For oKample, the area of "rhetorical 
anajyals" will be quite esdiaustlvaly indexed, as m exatniQatlon of .that f,:cet will 
prove. Oq the other hand, a term Buch aa 'para-mas saga' la relatively new in 
the field, and not yet 'eatabliahed. " It will therefore probably not appe^ir in 
the Thesaurous. The uaefulnesa of broad taiTns has been llluserated in the 
discussion of faceto - it gives one large area In which to manoevre and, by 
conbinlng semral desdriptors, to arrive at a fairly coacise delimitation of 
one's area of interest, 

Because the dossriptors for an airticlo are asslgiied to it rather than 
derived from It, it fcHowe that the Inforastion reveal-d by a saarwi will be 
more abstract than If a seflrch wao based on a apeciflc llQgulstlc string 
appearing In sn abatract. At one point an atten^pt was made to 'aeed^ the 
abstracts with the actual desc-lptor ternu3, but the timei tieedad for such a 
task outweighed any possible advantageg, Blnce there terns did not neceasasily 
appear In the body of the article Itself. To cofflpansata for this abstractnesa 
however, a list la attached to the ThesauruB containing such concTeto teniia as 



of Indivldumle, aasociatiana, coimtries and specific Rhetorical MDven^nts 
(e*g. Woman's Liberatisn) . Ttils we called a proper term list. By asing this 
list together with the ThesaurouSj it should be possible to achieve a high 
degrse of precisloa^ 

Finally, there is -cm question of hot-/ dyn.imlc the Thesayrug will prove no 
ba* Of course thare is uli^ays the daiiger that any authority list rims nha 
risk of stagnating the fisld, but tha couroiler^ toade a coiiselous effort to 
ragard the Thssaurus preliminriry at all stagfes, and therefore open to 
change, TMs disnge was aEiticipated In the decision to adopt the MLA fom- 
Thll puts aontrgl in the hands of researGhtrs, vA^om artlclss reflect the 
deTOloplng iaterasta of the dlselpllne, Whmrevar possible, author ganerstfid 
descriptor terms were adopted into the rhsoaurua* 

There Is still much work to be done to ioprove the ayatem* It works. It 
haa been demonstrated at the 1971 SCA and 1972 ICA Conventions, Now it n^aeds 
a greatly expanded and Improyad data base. Pact of this davalopment is in f^iat 
being carried out in the projeet reported on fvom Florida Stata University » It 
would also be IntBreating to examine the citations used In important papers 
and Include these articles in the data base. 

The Theaatiruni needs ptAllshlng so that maabers of the discipline may use 
it and offer their knowLadgable auggestiona for its iraproyeiient. Ac presadt, 
however, the project hm been abandoned by SCA tod its future appears uvtC',^rC#!ii\, 
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